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Abstract. The Nemhauser- Trotter local optimization theorem applies to the NP-hard 
Vertex Cover problem and has applications in approximation as well as parameterized 
algorithmics. We present a framework that generalizes Nemhauser and Trotter's result 
to vertex deletion and graph packing problems, introducing novel algorithmic strategies 
based on purely combinatorial arguments (not referring to linear programming as the 
Nemhauser- Trotter result originally did). 

We exhibit our framework using a generalization of Vertex Cover, called Bounded- 
Degree Deletion, that has promise to become an important tool in the analysis of gene 
and other biological networks. For some fixed d > 0, Bounded-Degree Deletion asks 
to delete as few vertices as possible from a graph in order to transform it into a graph 
with maximum vertex degree at most d. Vertex Cover is the special case of d = 0. Our 
generalization of the Nemhauser- Trotter theorem implies that Bounded-Degree Dele- 
tion has a problem kernel with a linear number of vertices for every constant d. We also 
outline an application of our extremal combinatorial approach to the problem of packing 
stars with a bounded number of leaves. Finally, charting the border between (param- 
eterized) tractability and intractability for Bounded-Degree Deletion, we provide a 
W[2]-hardness result for Bounded-Degree Deletion in case of unbounded d-values. 
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1. Introduction 

Nemhauser and Trotter [20] proved a famous theorem in combinatorial optimization. 
In terms of the NP-hard Vertex CoverQ problem, it can be formulated as follows: 

NT-Theorem [201 Hj. ^or an undirected graph G = {V, E) one can compute in polynomial 
time two disjoint vertex subsets A and B, such that the following three properties hold: 

(1) If S' is a vertex cover of the induced subgraph G\V\ {AyjB)], then Au S' is a vertex 
cover of G. 

(2) There is a minimum- cardinality vertex cover S of G with j4 C 5. 

(3) Every vertex cover of the induced subgraph G\V\ {A[J B)] has size at least \ V\[A[J 
B)\/2. 

In other words, the NT-Theorem provides a polynomial-time data reduction for Vertex 
Cover. That is, for vertices in A it can already be decided in polynomial time to put them 
into the solution set and vertices in B can be ignored for finding a solution. The NT- 
Theorem is very useful for approximating Vertex Cover. The point is that the search for 
an approximate solution can be restricted to the induced subgraph G[y \ [AuB)]. The NT- 
Theorem directly delivers a factor-2 approximation for Vertex Cover by choosing V\B 
as the vertex cover. Chen et al. [7] first observed that the NT- Theorem directly yields a 
2/c-vertex problem kernel for Vertex Cover, where the parameter k denotes the size of the 
solution set. Indeed, this is in a sense an "ultimate" kernelization result in parameterized 
complexity analysis [10^ [TT| [2T] because there is good reason to believe that there is a 
matching lower bound 2k for the kernel size unless P=NP |16| . 

Since its publication numerous authors have referred to the importance of the NT- 
Theorem from the viewpoint of polynomial-time approximation algorithms (e.g., [HHTj) as 
well as from the viewpoint of parameterized algorithmics (e.g., [I10I9]). The relevance of 
the NT-Theorem comes from both its practical usefulness in solving the Vertex Cover 
problem as well as its theoretical depth having led to numerous further studies and follow- 
up work [U m [9] . In this work, our main contribution is to provide a more general and more 
widely applicable version of the NT-Theorem. The corresponding algorithmic strategies and 
proof techniques, however, are not achieved by a generalization of known proofs of the NT- 
Theorem but are completely different and are based on extremal combinatorial arguments. 
Vertex Cover can be formulated as the problem of finding a minimum-cardinality set 
of vertices whose deletion makes a graph edge-free, that is, the remaining vertices have 
degree 0. Our main result is to prove a generalization of the NT-Theorem that helps in 
finding a minimum-cardinality set of vertices whose deletion leaves a graph of maximum 
degree d for arbitrary but fixed d. Clearly, d = is the special case of Vertex Cover. 

Motivation. Since the NP-hard Bounded-Degree Deletion problem — given a graph 
and two positive integers k and d, find at most k vertices whose deletion leaves a graph 
of maximum vertex degree d — stands in the center of our considerations, some more ex- 
planations about its relevance follow. Bounded-Degree Deletion (or its dual problem) 
already appears in some theoretical work, e.g., [6l[18l[22], but so far it has received consider- 
ably less attention than Vertex Cover, one of the best studied problems in combinatorial 
optimization [T7]. To advocate and justify more research on Bounded-Degree Deletion, 



^Vertex Cover is the following problem: Given an undirected graph, find a minimum-cardinality set S 
of vertices such that each edge has at least one endpoint in S. 
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we describe an application in computational biology. In the analysis of genetic networks 
based on micro-array data, recently a clique-centric approach has shown great success [3 [H] . 
Roughly speaking, finding cliques or near-cliques (called paracliques [H]) has been a central 
tool. Since finding cliques is computationally hard (also with respect to approximation), 
Chesler et al. [HI page 241] state that "cliques are identified through a transformation to 
the complementary dual Vertex Cover problem and the use of highly parallel algorithms 
based on the notion of fixed-parameter tractability." More specifically, in these Vertex 
CovER-based algorithms polynomial-time data reduction (such as the NT-Theorem) plays a 
decisive role [19] (also see [1]) for efficient solvability of the given real- world data. However, 
since biological and other real- world data typically contain errors, the demand for finding 
cliques (that is, fully connected subgraphs) often seems overly restrictive and somewhat re- 
laxed notations of cliques are more appropriate. For instance, Chesler et al. [8] introduced 
paracliques, which are achieved by greedily extending the found cliques by vertices that are 
connected to almost all (para) clique vertices. An elegant mathematical concept of "relaxed 
cliques" is that of s-plexe^ where one demands that each s-plex vertex does not need to be 
connected to all other vertices in the s-plex but to all but s — 1. Thus, cliques are 1-plexes. 
The corresponding problem to find maximum-cardinality s-plexes in a graph is basically 
as computationally hard as clique detection is [21(11]. However, as Vertex Cover is the 
dual problem for chque detection, Bounded-Degree Deletion is the dual problem for 
■s-plex detection: An n-vertex graph has an s-plex of size k iff its complement graph has a 
solution set for Bounded-Degree Deletion with d = s — 1 of size n — k, and the solution 
sets can directly be computed from each other. The Vertex Cover polynomial-time data 
reduction algorithm has played an important role in the practical success story of analyzing 
real- world genetic and other biological networks [3l|8]. Our new polynomial-time data re- 
duction algorithms for Bounded-Degree Deletion have the potential to play a similar 
role. 

Our results. Our main theorem can be formulated as follows. 

BDD-DR-Theorem (jTheorem 2|) . For an undirected n-vertex and m-edge graph G = 
{V,E), we can compute two disjoint vertex subsets A and B in 0{n^/'^ ■ m + n^) time, such 
that the following three properties hold: 

(1) If S' is a solution set /or Bounded-Degree Deletion of the induced subgraph G\V\ 
{AVJB)], then S := S' U A is a solution set for Bounded-Degree Deletion of G. 

(2) There is a minimum- cardinality solution set S for Bounded-Degree Deletion 
ofG with A(ZS. 

(3) Every solution set for Bounded-Degree Deletion of the induced subgraph G\V\ 
{A U B)] has size at least 

\V\{AiJB)\ 
d^ + W^ + Qd + A' 

In terms of parameterized algorithmics, this gives a {d^ -\- 'id?' -\- Qd -\- A) • A;- vertex 
problem kernel for Bounded-Degree Deletion, which is linear in k for constant d- 
values, thus joining a number of other recent "linear kernelization results" [Sj [T2| [TH I15j . 
Our general result specializes to a 4/c-vertex problem kernel for Vertex Cover (the NT- 
Theorem provides a size-2A; problem kernel), but applies to a larger class of problems. 



Introduced in 1978 by Seidman and Foster [24] in the context of social network analysis. Recently, this 
concept has again found increased interest [^118). 
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For instance, a slightly modified version of the BDD-DR-Theorem (with essentially the 
same proof) yields a 15fc-vertex problem kernel for the problem of packing at least k vertex- 
disjoint length-2 paths of an input graph, giving the same bound as shown in work focussing 
on this problem |23jl^ For the problem, where, given an undirected graph, one seeks a 
set of at least k vertex-disjoint star^ of the same constant size, we show that a kernel 
with a linear number of vertices can be achieved, improving the best previous quadratic 
kernelization [23]. We emphasize that our data reduction technique is based on extremal 
combinatorial arguments; the resulting combinatorial kernelization algorithm has practical 
potential and implementation work is underway. Note that for d = our algorithm computes 
the same type of structure as in the "crown decomposition" kernelization for Vertex 
Cover (see, for example, [T]). However, for d > 1 the structure returned by our algorithm 
is much more complicated; in particular, unlike for Vertex Cover crown decompositions, 
in the BDD-DR-Theorem the set A is not necessarily a separator and the set B does not 
necessarily form an independent set. 

Exploring the borders of parameterized tractability of Bounded-Degree Deletion 
for arbitrary values of the degree value d, we show the following. 

Theorem 1. For unbounded d (given as part of the input), Bounded-Degree Deletion 
is [2] -complete with respect to the parameter k denoting the number of vertices to delete. 

In other words, there is no hope for fixed-parameter tractability with respect to the 
parameter k in the case of unbounded d- values. Due to the lack of space the proof of 
ITheorem II and several proofs of lemmas needed to show [Theorem 21 are omitted. 

2. Preliminaries 

A bdd-d-set for a graph G = {V, E) is a vertex subset whose removal from G yields a 
graph in which each vertex has degree at most d. The central problem of this paper is 
Bounded-Degree Deletion 

Input: An undirected graph G = (V^E), and integers d > and /c > 0. 
Question: Does there exist a bdd-(i-set /S C y of size at most k for G? 

In this paper, for a graph G = iV, E) and a vertex set 5 C y, let G[S] be the subgraph 
of G induced by S and G — S := G[V \ S]. The open neighborhood of a vertex v or a vertex 
set 5 C y in a graph G = {V, E) is denoted as Ng{v) := {n e F | {«, v] G E} and Ng{S) := 
^v&s ^g{v) \ S, respectively. The closed neighborhood is denoted as A^cb] := Ng{v) U {v} 
and Ng[S] := Ng{S) U S. We write V{G) and E{G) to denote the vertex and edge set of G, 
respectively. A packing P of a graph G is a set of pairwise vertex-disjoint subgraphs of G. 
A graph has maximum degree d when every vertex in the graph has degree at most d. A 
graph property is called hereditary if every induced subgraph of a graph with this property 
has the property as well. 

Parameterized algorithmics [TT\ [5T] is an approach to finding optimal solutions 
for NP-hard problems. A common method in parameterized algorithmics is to provide 
polynomial-time executable data reduction rules that lead to a problem kernel |T3j. This is 
the most important concept for this paper. Given a parameterized problem instance (/, k), a 

^Very recently, Wang et al. [55] improved the 15fc-bound to a 7fc-bound. We claim that our kernelization 
based on the BDD-DR-Theorem method can be easily adapted to also deliver the 7fc-bound. 
star is a tree where all of the vertices but one are leaves. 



A GENERALIZATION OF NEMHAUSER AND TROTTER'S LOCAL OPTIMIZATION THEOREM 413 



data reduction rule replaces (/, k) by an instance (/', k') in polynomial time such that |/'| < 
|/|, k' < k, and {I,k) is a Yes-instance if and only if {I',k') is a Yes-instance. A param- 
eterized problem is said to have a problem kernel, or, equivalently, kernelization, if, after 
the exhaustive application of the data reduction rules, the resulting reduced instance has 
size f{k) for a function / depending only on k. Roughly speaking, the kernel size f{k) plays 
a similar role in the subject of problem kernelization as the approximation factor plays for 
approximation algorithms. 



3. A Local Optimization Algorithm for Bounded-Degree Deletion 

The main result of this section is the following generalization of the Nemhauser- Trotter- 
Theorem [20] for Bounded-Degree Deletion with constant d. 



Theorem 2 (BDD-DR-Theorem). For an n- vertex and m-edge graph G = {V,E), we can 
compute two disjoint vertex subsets A and B in 0(n^/^-m+n^) time, such that the following 
three properties hold: 

(1) If S' is a bdd-d-set of G - (A U B), then 5 := 5' U A is a bdd-d-set of G. 

(2) There is a minimum-cardinality bdd-d-set S of G with ^4 C S". 

(3) Every bdd-d-set oi G — {AU B) has size at least . 

This first two properties are called the local optimality conditions. The remainder of this 
section is dedicated to the proof of this theorem. More specifically, we present an algorithm 
called compute_AB (see [Figure 1 ) which outputs two sets A and B fulfilling the three 



properties given in [Theorem 2[ The core of this algorithm is the procedure find_extremal 
(see Figure 2) running in 0(n^/^ • m + n^) time. This procedure returns two disjoint vertex 



subsets C and D that, among others, satisfy the local optimality conditions. The procedure 
is iteratively called by compute_AB. The overall output sets A and B then are the union 
of the outputs of all applications of find_extremal. Actually, find_extremal searches 
for Ccy, Dcy, GnD = satisfying the following two conditions: 
CI Each vertex in Ng[D] \ G has degree at most d in G — C, and 
C2 C is a minimum-cardinality bdd-d-set for G[G U D]. 

It is not hard to see that these two conditions are stronger than the local optimality condi- 
tions of [Theorem 21 

Lemma 1. Let G and D be two vertex subsets satisfying conditions C[T]and C[2j Then, the 
following is true: 

(1) If S' is a bdd-d-set of G - (C U then S := S' U G is a bdd-d-set of G. 

(2) There is a minimum-cardinality bdd-d-set S* of G with G C S. 

ILemmaT] will be used in the proof of [Theorem 21 — it helps to make the description of 
the underlying algorithm and the corresponding correctness proofs more accessible. As a 
direct application of lTheorem 2[ we get the following corollary. 

Corollary 1. Bounded-Degree Deletion with constant d admits a problem kernel with 
at most (d^ + 4(i^ + 6d + 4) ■ k vertices, which is computable in 0{n^/'^ ■ m + n^) time. 

We use the following easy-to-verify forbidden subgraph characterization of bounded- 
degree graphs: A graph G has maximum degree d if and only if there is no "(d -|- l)-star" 
in G. 
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Algorithm: compute_AB (G) 

Input: An undirected grapli G. 

Output: Vertex subsets A and B satisfying the three properties of lTheorem 2[ 

1 A := %,B:= 

2 Compute a witness X and the corresponding residual Y ■.= V\X\or G 

3 If |y| <{d + if ■ \X\ then return {A, B) 

4 {G,D) ^ find_extremal {G,X,Y). 

5 G ^ G - {C {J D)] A ^ Au G]B ^ B D; goto lineE 



Figure 1: Pseudo-code of the main algorithm for computing A and B. 



Definition 3.1. For s > 1, the graph Ki s = ({n, ui, . . . , Vg}, {{u, vi}, . . . , {u, Vs}}) is called 
an s-star. The vertex u is called the center of the star. The vertices vi, . . . ,Vs are the leaves 
of the star. A <s-star is an s'-star with s' < s. 

Due to this forbidden subgraph characterization of bounded-degree graphs, we can also 
derive a linear kernelization for the {d+ 1)-Star Packing problem. In this problem, given 
an undirected graph, one seeks for at least k vertex-disjoint {d -\- l)-stars for a constant d. 
With a slight modification of the proof of lTheorem 2\ we get the following corollary. 

Corollary 2. {d + 1)-Star Packing admits a problem kernel with at most (d^ -t- M'^ + 
6d + 4) ■ k vertices, which is computable in 0{n^^'^ ■ m + nP) time. 

For d > 2, the best known kernelization result was a 0{k^) kernel [23]. Note that the 
special case of {d + 1)-Star Packing with d = 1 is also called P3-PACKING, a problem 



well-studied in the literature, see [231 I25j . Corollary 2 gives a 15A:- vertex problem ker 



nel. The best-known bound is 7k [25]. However, the improvement from the formerly best 
bound 15k [23] is achieved by improving a properly defined witness structure by local mod- 
ifications. This trick also works with our approach, that is, we can show that the NT-like 
approach also yields a T/c-vertex problem kernel for 2-Star Packing. 

3.1. The Algorithm 

We start with an informal description of the algorithm. As stated in the introduction 



of this section, the central part is Algorithm compute_AB shown in Figure 1 

Using the characterization of bounded-degree graphs by forbidding large stars, in line [2] 
compute_AB starts with computing two vertex sets X and Y: First, with a straightforward 
greedy algorithm, compute a maximal {d + l)-star packing of G, that is, a set of vertex- 
disjoint {d+ l)-stars that cannot be extended by adding another {d+ l)-star. Let X be the 
set of vertices of the star packing. Since the number of stars in the packing is a lower bound 
for the size of a minimum bdd-d-set, A is a factor-(d-|- 2) approximate bdd-d-set. Greedily 
remove vertices from X such that X is still a bdd-d-set, and finally set Y := V \ X. We 
call X the witness and Y the corresponding residual. 

If the residual Y is too big (condition in line[3|), the sets X and Y are passed in lineH] 
to the procedure find_extremal in [Figure 2 which computes two sets C and D satisfying 



conditions C|T] and CEl Computing X and Y represents the first step to find a subset pair 
satisfying condition C[TJ Since there is no vertex that has degree more than d in G — X (due 
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Procedure: find_extremal {G,X,Y) 

Input: An undirected graph G, witness X, and residual Y. 

Output: Vertex subsets C and D satisfying the local optimality conditions. 

1 J ^ bipartite grapli witli X and Y as its two vertex subsets and 

E{J) ^ {{u, v} G E{G) \ u£X andvGY} 

> Initialize empty set of forbidden vertices 
start witli i = and while ^ X do > Loop while not all vertices in X are forbidden 
fJ ^ NG[Nj{Ff)\ \X > Determine forbidden vertices in Y 

P ^ star-packing (J - (Ff u Fj), X \ Ff, Y \ Fj , d) 

Do^Y \ {fJ u y(P)) i> Vertices in Y that are not forbidden and not in P 

Start search for C, D satisfying (JB 



2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 



^0 



start witli i = and repeat 

Ci ^ Nj{Di) 
A+i ^ Np{Ci) u A 

untii A = A-i 
G ^ C^, D ^ Di 
\tC = X\Ff then 
return (C, D) 

x\c 

J ^ i + 1 

17 end wliiie 

18 Fj^ ^NG[Nj{Ff)]\X 

19 return (0, F \ (X u F/')) 



> C,D also satisfy CQ 



> Determine forbidden vertices in X for next iteration 



> Recompute forbidden vertices in Y (as in line\^ 



Procedure: star-packing {J,Vi,V2,d) 

Input: A bipartite graph J with two vertex subsets Vi and V2- 

Output: A maximum-edge packing of stars that have their centers in Vi and have at 
most d + 1 leaves in V2 ■ 

See ILemma"2t the straightforward implementation details using matching techniques are 
omitted. 



Figure 2: Pseudo-code of the procedure computing the intermediary vertex subset 
pair (C, D). 



to the fact that X is a bdd-d-set), the search is limited to those subset pairs where C is a 
subset of the witness X and D is a subset of Y. 

Algorithm compute_AB calls find_extremal iteratively until the sets A and B, which 
are constructed by the union of the outputs of all applications of find_extremal (see line [5]), 
satisfy the third property in Theorem [2j In the following, we intuitively describe the basic 
ideas behind find_extremal. 

To construct the set C from X, we compute again a star packing P with the centers 
of the stars being from X and the leaves being from Y. We relax, on the one hand, the 
requirement that the stars in the packing have exactly d + 1 leaves, that is, the packing P 
might contain < d-stars. On the other hand, P should have a maximum number of edges. 
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The rough idea behind the requirement for a maximum number of edges is to maximize the 
number of (d+ l)-stars in P in the course of the algorithm. Moreover, we can observe that, 
by setting C equal to the center set of the {d + l)-stars in P and D equal to the leaf set 
of the {d + l)-stars in P, C is a minimum bdd-d-set of G[C U D] (condition C^. We call 
such a packing a maximum- edge X-center < {d + l)-star packing. For computing P, the 
algorithm constructs an auxiliary bipartite graph J with X as one vertex subset and Y as 
the other. The edge set of J consists of the edges in G with exactly one endpoint in X. 



See line [T] of Figure 2 Obviously, a maximum-edge X-center < {d + l)-star packing of G 
corresponds one-to-one with a maximum-edge packing of stars in J that have their centers 
in X and have at most d + 1 leaves in the other vertex subset. Then, the star packing P can 
be computed by using techniques for computing maximum matchings in J (in the following, 
let star-packing(J,Vi,V2)rf) denote an algorithm that computes a maximum-edge Vi-center 
< (d + l)-star packing P on the bipartite graph J). 

The most involved part of find_extremal in [Figured is to guarantee that the output 



subsets in line H] fulfill condition C[TJ To this end, one uses an iterative approach to compute 
the star packing P. Roughly speaking, in each iteration, if the subsets C and D do not 
fulfill condition Cdl then exclude from further iterations the vertices from D that themselves 



or whose neighbors violate this condition. See lines [2] to [l5] of Figure 2 for more details of 
the iterative computation. Herein, for j > 0, the sets Fj^ C X and C Y, where Fj^ is 
initialized with the empty set, and FJ" is computed using Fj^ , store the vertices excluded 
from computing P. To find the vertices that themselves cause the violation of the condition, 
that is, vertices in D that have neighbors in X\C, one uses an augmenting path computation 
in lineslTltollllto get in line[T2]subsets G and D such that the vertices in D do not themselves 
violate the condition. Roughly speaking, the existence of an edge e from some vertex in D 
to some vertex in X \ C would imply that the < (d + l)-star packing is not maximum 
(witnessed by an augmenting path beginning with e — in principle, this idea is also used for 
finding crown decompositions, cf. [1]). The vertices whose neighbors cause the violation of 
condition CU are all vertices in D with neighbors mY\D that themselves have neighbors 
in X \ G. These neighbors in y \ D and the corresponding vertices in D are excluded in 
line [Hand line[l8l We will see that the number of all excluded vertices is 0{\X \ G\), thus, 
in total, we do not exclude too many vertices with this iterative method. The formal proof 
of correctness is given in the following subsection. 

3.2. Running Time and Correctness 



Now, we show that compute_AB in Figure 1 computes in the claimed time two vertex 



subsets A and B that fulfill the three properties given in [Theorem 21 
3.2.1. Running Time of find_extremal. We begin with the proof of the running time of 



the procedure find_extremal in Figure 2] which uses the following lemmas. 



Lemma 2. Procedure star-packing( J, Vi, V2,d) in Figure 2 runs in 0{\/n ■ m) time. 



The next lemma is also used for the correctness proof; in particular, it guarantees the 
termination of the algorithm. 



Lemma 3. If the condition in line 1131 of Figure 2 is false for a j > 0, then F^^ C F^^^. 
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Proof. In lines [J] and [5] of Figure 2, all vertices in Fj^ and their neighbors Nj{Fj^) are 
excluded from the star packing P in the jth. iteration of the outer loop. Moreover, the 
vertices in Nj{F^) are excluded from the set Dq (lineE]). Therefore, a vertex in F^ cannot 
be added to C in line [IS Thus -Pj'^i (set to X \ C in line fTS]) contains F^ . Moreover, this 
containment is proper, as otherwise the condition in line [13] would be true. ■ 

Lemma 4. Procedure find_extremal runs in 0{n^/'^ ■ m + v?) time. 



3.2.2. Correctness of find_extremal. The correctness proof for find_extremal in Figure 2 
is more involved than its running time analysis. The following lemmas provide some prop- 
erties of (C, D) which are needed. 

Lemma 5. For each j > the following properties hold after the execution of line [T2] 
in [Figure 2 



(1) every vertex in C is a center vertex of a {d+ l)-star in P, and 

(2) the leaves of every star in P with center in C are vertices in D. 

Proof. (Sketch) To prove ([T|), first of all, we show that v £ C implies v G V{P), since, oth- 
erwise, we could get a P-augmenting path from some element in Dq to v. A P-augmenting 
path is a path where the edges in E{P) and the edges not in E[P) alternate, and the 
first and the last edge are not in E{P). This P-augmenting path can be constructed in an 
inductive way by simulating the construction of Ci in lines (61 to 1111 of [Figure 2 From this 



P-augmenting path, we can then construct a X-center < {d+ l)-star packing that has more 
edges than P, contradicting that E[P) has maximum cardinality. Second, every vertex 
in C is a center of a star due to the definition of P and Procedure star-packing. Finally, 
if a vertex v € C is the center of a star with less than {d 1) leaves, then again we get a 
P-augmenting path from some element in Dq to v. 

The second statement follows easily from Procedure star-packing and the pseudo-code 
in lines [6] to [l2j ■ 

Lemma 6. For each j > there is no edge in G between D and Nj{Fj^). 

Proof. The vertices in F^^ and the vertices in NciNj^Ff)] \ X are excluded from the 
computation of P and are not contained in Dq (lines |H to [6| in [Figure 3). Thus, Nj[Ff]r\D = 



and therefore there are no edges in G between D and Nj{Ff). 

The next lemma shows that the output of find_extremal fulfills the local optimality 
conditions. 

Lemma 7. Procedure find_extremal returns two disjoint vertex subsets fulfilling condi- 
tions cm and CEl 

Proof. Clearly, the output consists of two disjoints sets. The algorithm returns in lines [TH 
or [ini of [Figure 2 If it returns in line [191 then the output C is empty and D contains 



only vertices that have a distance at least 3 to the vertices in X: The condition in line [3| 
implies Fj^ = X and, therefore, FJ" contains all vertices in G \ X that have distance at 
most 2 to the vertices in X. Since X is a bdd-d-set of G, all vertices in D and their 
neighbors in G have a degree at most d. This implies that both conditions hold for the 
output returned in this line. It remains to consider the output returned in line [TH 
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To show that condition C[T]holds, recall that G—X has maximum degree d and that C C 
X. Therefore, if for a vertex v m.V \ X we have Nj{v) C C, then v has degree at most d 
m G — C. Thus, to show that each vertex in Nc[D] \ G has degree at most d in G — G, 
it suffices to prove that Nj{Ng[D] \G) C G. We show separately that Nj{D) C G and 
that NjiNaiD) \ C) C C. 

The assignment in line Eland the until-condition in line [TT] directly give Nj(D) C G. 
Due to ILemma 61 there is no edge in G between D and Nj{F^), where FJ^ = X \ G (the 
if-condition in line [THl which has to be satisfied for the procedure to return in line I14p . 
From this it follows that the vertices in Ng{D) \ G have no vertex in Fj^ as neighbor and, 
thus, NjiNciD) \ C) n Ff = 0. Therefore, Nj{Ng{D) \ C) C C. 

By Properties [T] and [2l of [Lemma 51 there are exactly \G\ many vertex-disjoint {d + 1)- 
stars in G[G U D]. Moreover, there is no {d + l)-star in (^[-D], since X is a bdd-d-set of G. 
Thus, C is a minimum-cardinality bdd-d-set of G[G L) D]. m 



3.2.3. Running Time and Correctness of compute_AB. To prove the running time and 
correctness of compute_AB, we have to show that the output of find_extremal contains 
sufficiently many vertices of Y. To this end, the following lemma plays a decisive role. 

Lemma 8. For all j > 0, the set FJ' in line [Hand line 1181 of Figure~2] has size at most {d + 

Proof. The proof is by induction on j. The claim trivially holds for j = 0, since Fq = 0. 
Assume that the claim is true for j > 0. Since F^^ C F^^-^ (jLemma 3p . we have 

Fjl, = Fj\J NG-xWj^^HFf^, \ if)]. 

We first bound the size of Nj_pY (if i \ if)- Since if i was set to A \ C at the end of the 

jth iteration of the outer loop (line fTS]) . the vertices in Nj_pY{F:j^i\Ff) were not excluded 

from computing the packing P (line [5]) of the jth iteration. Moreover, Aj_pi'(if \if ) C 

V{P) for the star packing P computed in the jth iteration, since, otherwise, the set Dq in 
lineOwould contain a vertex v in N j_py{FJ^^ and, then, line[S]would include Nj{v) 

into C, which would contradict the fact that G n if i = (line I15|) . Due to property [2] 
in ILemma 51 the leaves of every star in P with center in G are vertices in D and, thus, the 
vertices in N j_pY{Fj^-^ \^f') leaves of stars in P with centers in if i V-^/"- Since each 

star has at most (d+l) leaves, the set Nj_pY{Fj^^\Fj^) has size at most (d-|-l)-|F^^-|^ \ if |. 

The remaining part is easy to bound: since all the vertices mV\X have degree at most d, 
we get 

'NG-x[Nj_pY{Ff_,, \ Ff)] I < (d . (d + 1) + (d + 1)) . \Ff^, \ if I 

= {d + lf-\Ff_,,\Ff\. 
With the induction hypothesis, we get that 

|if il < \Fl\ + \NG-x[Nj_pY{Ffi, \ if )]| 

= {d+ 1)2 . |if I + {d+ 1)2 . \Ff^, \ Ff\ = {d+ 1)2 . |if ,|. 
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Lemma 9. Procedure find_extremal always finds two sets C and D such that |y \ D| < 
{d+lf-\X\C\. 

Proof. If find_extremal terminates, then V' = Fj^ U FJ' for the graph G' = {V, E') 
resulting by removing C U D from G. Since C C X and D C y, we have X \ C = Fj^ 
andY\D = FJ , and bv lLemma Sl it follows immediately that \Y \D\ < {d+lf ■\X\C\.u 

Therefore, if |y| > {d + 1)^ • then find_extremal always returns two sets C and D 
such that D is not empty. 

Lemma 10. Algorithm compute_AB runs in 0{n^^'^ ■ m + n^) time. 

Lemma 11. The sets A and B computed by compute_AB fulfill the three properties 
given in [Theorem 2[ 



Proof. Since every {G,D) output by find_extremal in lined] of compute_AB in Figure 1 



fulfills conditions C[T]and (jLemma 7p . the pair {A,B) output in line [3] of compute_AB 
fulfills conditions C[T]and and, therefore, also the local optimality conditions (jLemma ip . 
It remains to show that {A, B) fulfills the size condition. 

Let X and Y be the last computed witness and residual, respectively. Since the condi- 
tion in line [3] is true, we know that \Y\ < {d + 1)^ • \X\. Recall that X is a factor-((i + 2) 
approximate bdd-(i-set for G' := G — {Au B). Thus, every bdd-d-set of G' has size at 
least \X\/{d + 2). Since the output sets A and B fulfill the local optimality conditions and 
the bounded-degree property is hereditary, every bdd-d-set of G' has size at least 

\x\ (*) \V'\ \V'\ 

> 



d + 2 - (d + 2)((d + 1)2 + 1) + 4^2 + 6d + 4) ■ 

The inequality (*) follows from the fact that Y is small, that is, |y| < {d + 1)2 • \X\ (note 

that y = xuy). ■ 

With Lemmas 1101 and II H the proof of [Theorem 21 is completed. 



4. Conclusion 

Our main result is to generalize the Nemhauser- Trotter-Theorem, which applies to 
the Bounded-Degree Deletion problem with d = (that is. Vertex Cover), to 
the general case with arbitrary d > 0. In particular, in this way we contribute problem 
kernels with a number of vertices linear in the solution size k for all constant values of d 
for Bounded-Degree Deletion. To this end, we developed a new algorithmic strategy 
that is based on extremal combinatorial arguments. The original NT-Theorem PU] has 
been proven using linear programming relaxations — we see no way how this could have 
been generalized to Bounded-Degree Deletion. By way of contrast, we presented a 
purely combinatorial data reduction algorithm which is also completely different from known 
combinatorial data reduction algorithms for Vertex Cover (see [UlllE])- Finally, Baldwin 
et al. [31 page 175] remarked that, with respect to practical applicability in the case of 
Vertex Cover kernelization, combinatorial data reduction algorithms are more powerful 
than "slower methods that rely on linear programming relaxation". Hence, we expect 
that benefits similar to those derived from Vertex Cover kernelization for biological 
network analysis (see the motivation part of our introductory discussion) may be provided 
by Bounded-Degree Deletion kernehzation. 
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